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Abstract — We consider a distributed antenna system where L 
antenna terminals (ATs) are connected to a Central Processor 
(CP) via digital error-free links of finite capacity Ro, and serve 
L user terminals (UTs). This system model has been widely 
investigated both for the uplink and the downlink, which are 
instances of the general multiple-access relay and broadcast 
relay networks. In this work we focus on the downlink, and 
propose a novel downlink preceding scheme nicknamed "Reverse 
Quantized Compute and Forward" (RQCoF). For this scheme 
we obtain achievable rates and compare with the state of the 
art available in the literature. We also provide simulation results 
for a realistic network with fading and pathloss with K > L 
UTs, and show that channel-based user selection produces large 
benefits and essentially removes the problem of rank deficiency 
in the system matrixj] 

I. System and problem definition 

We consider a distributed antenna system (DAS) with K 
user terminals (UTs) and L "antenna terminals" (ATs). All 
UTs and ATs have a single antenna each. The ATs are 
connected with a central processor (CP) via wired links of 
fixed rate i?o- We study the downlink scenario, where the CP 
wishes to deliver independent messages to the UTs. This is a 
simple instance of a broadcast relay network, where the ATs 
operate as relays. In this work we focus on the symmetric 
rate, i.e., all messages have the same rate and assume that the 
CP and all UTs have perfect channel state information (more 
general results are provided in |T|). If Rq oo, the problem 
reduces to the well-known vector Gaussian broadcast channel, 
the capacity region of which is achieved by Dirty Paper Coding 
(DPC). However, for fixed finite Rq, DPC and other widely 
considered linear precoding schemes cannot be applied in a 
straightforward manner. A simple DAS system, the so-called 
Soft-Handoff model, was investigated in |2|, by introducing a 
"compressed" version of DPC (CDPC), where the CP performs 
joint DPC under per-antenna power constraint and then sends 
the compressed (or quantized) codewords to the corresponding 
ATs via the wired links. While this scheme is expected to be 
near-optimal for very large Rq, it is generally suboptimal at 
finite (possibly small) Rq. Also, DPC is notoriously difficult 
to be implemented in practice, due to the nested lattice coding 
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construction and lattice quantization steps involved (See for 
example g, ||4))- 

Motivated by Compute-and-Forward (CoF) fS] (or quantized 
compute-and-forward (QCoF) |6|), we propose a novel coding 
strategy named Reverse QCoF (RQCoF) for the DAS downlink 
with finite backhaul link capacity Rq. In QCoF and RQCoF 
the coding block length n can be arbitrarily large but the 
shaping block length is restricted to 1 (scalar quantization ||^). 
However, we would like to point out that the same approach 
can be straightforwardly applied to CoF based schemes, where 
also the shaping dimension becomes large (in this case, we 
would refer to the scheme as Reverse CoF (RCoF)). 

A. Overview of QCoF 

Let Tjp — Z, mod pZ denote the finite field of size p, with 
p a prime number, denote addition over Zp, and g : Zp ^ M 
be the natural mapping of the elements of Zp onto {0, 1, 
1} C M. For a lattice A, let Qa(x) — argmin^g^{||x — 
A||} denote the associated lattice quantizer, V = {x e R" : 
Qa(x) — 0} the Voronoi region and define [x] mod A = x— 
Qa(x). For K e R, consider the two nested one-dimensional 
lattices As — {x = npz : z S Z} and Kc ^ {x — k,z : z £ 
Z}, and define the constellation set 5 = Ac n Vj, where Vg 
is the Voronoi region of Kg, i.e., the interval [— Kp/2). 
The modulation mapping m : lip — >■ 5 is defined by v ~ 
m{u) = [Kg{u)\ mod A^. The inverse function r7i^^(-) is 
referred to as the demodulation mapping, and it is given by 
u — m''^{v) = g~^{[v/K] mod pZ) with v £ S. 

Consider the (real-valued) L-user Gaussian multiple access 
channel with inputs {xe^i : i — 1, n} for ^ = 1, ...,L, output 
{yi : i = l,...,n} and coefficients h = (/ii, /i^)^ e M^, 
defined by 

L 

Ui = '^hexi^i + Zi, for i = l,...,n, (1) 

1=1 

where the z/s are i.i.d. ^ A/^(0, 1). All users encode their 
information messages {wi G Z^ : £ — 1, . . . , L} using 
the same linear code C over Zp (i.e., denoting information 
sequences and codewords by row vectors, we have ce = weG 
where G is a generator matrix for C), and produce their 



channel inputs according to 

xe.i = [m{ci,i) + di,i] mod A^, i = l,...,n, (2) 

where q ^ is the i-th symbol of ce and de/s are i.i.d. dithering 
symbols ~ Uniform(Vs), known at the receiver. The channel 
inputs Xi,i are uniformly distributed over Vs and have second 
moment SNR ^ E[|a;£_ip] — The receiver's goal is 

to recover a linear combination c = Q qeC£ of the transmitted 
users' codewords, for some coefficients qg e Zp. For this 
purpose, the receiver selects the integer coefficients vector 
a = (ai,...,aL)^ G Z'^ and produces the sequence of 
quantized observations 

Ui ^ m^^ ([Qa, {ayi - di)] mod A^) , (3) 

for j = 1, . . . , n. It easy to show Q that Q is equivalent to 

L 

= (09<!C<;,») ® ii, (4) 

£=1 

with qi = g~^{[ai\ modpZ). Here, Zi = m~^{[Q\^{e)] 
mod As) where e denotes the effective noise, capturing a 
Gaussian additive noise and non-integer penalty, and its vari- 
ance |(6| is 

erf = a"r(SNR^^I + hh"r)-ia. (5) 

By ||6[ Th. 1], the achievable computation rate of QCoF is 
given by 

i?Qc„F = logp - (6) 

Also, by ||5] Th. 4], the achievable computation rate of CoF 
is given by 

i?c„p(a?) = ^log(SNR/a2). (7) 

We showed in (6) that, for fixed large SNR ^ 1 and 
sufficiently large p (e.g., p > 251), the (|6]) and (j7]| differ 
approximately by the shaping gain, i.e., w 0.25 bits per real 
dimension. 

Remark 1: In order to achieve the CoF rate, p must grow to 
infinity in the lattice construction and the rank of the system 
matrix Q is the same as the rank of A over M, by |5 , Th. 11]. 

II. Reverse Quantized Compute- and-Forward 

The main idea is that each UT decodes a linear combination 
(over the finite field) of the messages sent by the ATs using 
QCoF. In short, we exchange the role of the ATs and UTs and 
use QCoF in the reverse direction. However, decoding linear 
combination of the information messages is useful only when 
these combinations can be shared such that the individual 
messages can be recovered, provided that the resulting system 
of linear equations is invertible over Zp. Since the UTs do 
not cooperate, sharing the decoded linear combinations is 
impossible in the downlink. Nevertheless, thanks to algebraic 
structure of QCoF (or CoF), the messages from the ATs can 
be the precoded versions of the original information messages 
and hence, using an appropriate invertible precoding over Zp 
at the CP, the effect of the linear combination can be undone 



at the transmitter, so that every UT obtains just its own desired 
message. We present coding strategies considered in this 
work, assuming the K = L and (real-valued) channel matrix 
H e R^^^. Let Q denote the system matrix whose elements 
in the ^-th row, denoted by qj = {qe,i, ...jqi.L), indicate the 
coefficients of the linear combination decoded at the £-th UT as 
given in Q. For the time being we assume that these matrices 
are full rank over Zp, although they may be rank deficient 
since each UT chooses its own linear combination coefficients 
independently of the other nodes. The case of rank deficiency 
will be handled later. Let zg be the discrete additive noise (over 
Zp) at the £-th UT. The detailed description of "reverse" QCoF 
(RQCoF) is as follows. 

• For the given Q, the CP precodes the user information 
messages {we e Zp : f = 1,...,L} using the inverse 
system matrix Q~^. The precoded L-dimensional vectors 
of information symbols to be transmitted by the ATs are 
given by 

{Hi,t, ■■; tiL,iV ^Q^^iwi,t,...,WL.i)''', for i = l,...,fc. 

(8) 

• The CP forwards each block fj,g — ...,^e^k) to the 
£-th AT, during n time slots, corresponding to the duration 
of a codeword sent on the wireless channel. Therefore, 
we have the rate constraint (k/n) \ogp < Rq. 

• After receiving k symbols, the £-th AT locally encodes 
its information symbols /x^ using the same linear code C 
over Zp (i.e., cg = G), and produces its channel input 
according to 

xg,i = [m{cg^^) + di,^] modA^, for i = l,...,n. 

(9) 

• By f6' Th. 1], the £-th UT can recover a noiseless linear 
combination of ATs' information symbols if R < logp — 
ma.xi{H{zi)}. This is given by 

= wg^i, for i = 1, . . . , fc. 

(10) 

Hence, the ^-th UT can successfully recover its desired 
message. 

The following rate is achievable by RQCoF: 

i?RQCoF = min{i?o,logp- max{i/(z£)}. (11) 

Similarly, from (|7|, we can get an achievable rate per user of 
RCoF 

i?RcoF = min{i?o, mjn{i?c„F(o'f J}}. (12) 

Finally, the achievable rate of RQCoF (or RCoF) is maximized 
by minimizing the variance of effective noise in (j5]l with 
respect to A subject to the system matrix Q is full rank 
over Zp. This problem was solved in |]6j using the LLL 
algorithm f7 |, possibly followed by Phost or Schnorr-Euchner 
enumeration (See |^) of the non-zero lattice points in a sphere 
centered at the origin, with radius equal to the shortest vector 
found by LLL. 



III. Compressed Integer-Forcing Beamforming the second moment of » is determined as 



In short, the idea underlying RQCoF is that each UT con- 
verts its own downlink channel into a discrete additive-noise 
multiple access channel over Zp. Since each UT is interested 
only in its own message, the CP can precode the messages 
using zero-forcing linear precoding over Zp, at no transmit 
power additional cost (unlike linear zero-forcing over M). It 
is known that the performance of CoF (and therefore QCoF) 
is quite sensitive to the channel coefficients, due to the non- 
integer penalty, since the channel coefficients are not exactly 
matched to the integer coefficients of linear combinations 
||6^|. The same problem arises in RQCoF (or RCoF), due to 
their formal equivalence. In [9], it was shown that integer- 
forcing linear receiver (IFLR) can eliminate this penalty by 
forcing the effective channel matrix to be integer. Here, we 
propose a new beamforming strategy named Integer-Forcing 
Beamforming (IFBF), that produces a similar effect for the 
downlink. 

We present the IFBF idea assuming Rq = oo, as the dual 
scheme of IFLR, and consider finite Rq later. In IFBF, the 
beamforming vectors W = [wi, wl] are chosen such that 
the effective channel matrix H = HW is integer-valued. 
Then, the channel matrix is inverted over by using RQCoF, 
as previously presented. In this case, since H G Z^^^, 
RQCoF does not suffer from the non-integer penalty. Further, 
we extend IFBF to the case of finite Rq by using quantization, 
as in done in [2|, where CP forwards the quantized sequences 
to the ATs for which the quantization noise is determined 
from standard rate-distortion theory bounds. It is assumed that 
H G M^^^ is full rank and the detailed description of IFBF 
is as follows. For a given A G Z^^^ (optimized later), the 
CP uses the beamforming matrix W — H^^A and the system 



matrix Q = [A] mod pZ as in Section I- A 



Assuming that Q is full rank over Zp, the CP produces the 
downlink streams — {xt_i : i = 1, . . . , n}, for £ = 1, . . . , i 
as follows. 

• The CP precodes the user information messages {w^ G 
Zp:^=l,...,i} using the inverse system matrix Q^^: 



(13) 



for i = 1, . . . , k. 
• The CP encodes the precoded information messages using 
the same linear code C over Zp (i.e., ce — fi(G) and 
produces the downlink stream according to 

Xi,i = [m^ce^i) + di,t] mod A^, for i = l,...,n. 

(14) 

Using the predefined W, the CP produces the precoded 
channel inputs {vi^i : i = 1, . . . ,n} using 

. . . , VL,i)~'^ = W(a;i,i, . . . , xla)'^, for i = 1, . . . , n, 

and forwards them to the ATs via the wired links. Consistently 
with our system definition, we impose a per-antenna power 
constraint equal to SNR (with suitable normalization). Hence, 



2] =SNR/max{||H-ia,f}, (15) 



which guarantees that the power of the signal transmitted from 
the £-th AT has the required power = SNR. The 

received signal at the £-th UT is given by 



— 3-£ X£_i 



for z = 1, . 



(16) 



Notice that thanks to the IFBF the non-integer penalty is equal 
to zero. So, every UT can recover its desired messages by 
decoding the linear combination of ATs' messages with integer 



coefficients as shown in ( 10 1. Finally, the achievable rate of 
IFBF with RQCoF can be obtained by numerically computing 
the entropy of discrete additive noise over Zp corresponding 
to effective noise ^ A/'(0, max^jj |H^^a^| p}) where the 
impact of power constraint is included in the effective noise. 
The following rate is achievable by IFBF with RQCoF: 



= logp - max{iJ(z^)} 



(17) 



for any full-rank matrix Q, where zt = 
mod As). From (j7|, the following rate is achievable by IFBF 
with RCoF: 

^.FBP - \ log(SNR/max{||H-ia,||2}) (18) 



for any full-rank integer matrix A. 

For the case of finite Rq, we propose a "compressed" IFBF 
(CIFBF) where the CP forwards the quantized channel inputs 
— Vi^i + zi^i for i = to the ^-th AT, where 

{zi^i : i — 1, . . . ,n} denotes the quantization noise sequence, 
with variance (quantization mean-square error) equal to (t|. 
From the standard rate-distortion theory, the CP can forward 
the {vi,i : i = 1, n} to the £-th AT if 



(19) 



where the index i is omitted for brevity. Using the well-known 
maximum entropy argument on ( [T9] l we have the bound 



Hvf.ve) < ilog(SNR/a|). 



(20) 



From (19l and (20i, we obtain cr| = SHR/2'^'^° and 
IE[|w£,z|1 = SNR7(1 + l/(22'Ro - i)), due to the power 
constraint. Accordingly, we have 

E[|x,,,|2] =SNR/max{||H-ia,|p}(l + l/(22«o-l)). (21) 



Also, the effective noise at the ^-th UT is given by 

L 



(22) 



fe=i 



where the second term captures the impact of quantization 
noise and its variance is 



= l + ||h,||2SNR/2 



2-Ro 



(23) 



Finally, the achievable rate of CIFBF with RQCoF can be 
obtained numerically computing the entropy of discrete ad- 
ditive noise over Zp corresponding to the effective noise 



e\ ^ A/'(0, CT^, ) where the impact of power constraint and 
quantization noise are included in the effective noise: 

g\ = max{||H-ia,||2}(l + (1 + ||h,|pSNR)/(22«« - 1)). 

(24) 

The following rate is achievable by CIFBF with RQCoF: 

^ciFBF = logp - max{iJ(z^)} (25) 

for any full-rank matrix Q, where zg = r7i^^([QA^(e^)] 
mod As). From the (j7|, the following rate is achievable by 
CIFBF with RCoF: 

i?ciFBP = i?,PBF-^niax{log(l + (l + ||h^|pSNR)/(22«"-l))} 

for any full-rank integer matrix A. Finally, the achievable rate 
is maximized by minimizing the max^H |H^^a£| p} subject to 
full-rank constraint. This problem can be thought of as finding 
the L linearly independent "shortest lattice points" of the L- 
dimensional lattice generated by H^^. This can be efficiently 
obtained using the LLL algorithm |7|. Specifically, for a given 
lattice A defined by A = {x = H^^z : z € Z^}, a reduced 
basis of lattice is obtained through a unimodular matrix U such 
that A = {x = H-^Uz : z e Z^}. Let F = H-^U generates 
the same lattice but has "reduced" columns, i.e., the columns 
of F have small 2-norm. The solution of the original problem 
can be chosen as = where denotes the i-th column 
of U. While finding the optimal reduced basis for a lattice 
(e.g., finding the optimal U) is an NP-hard problem, the LLL 
algorithm finds a good reduced basis with low-complexity |]7). 

Remark 2: In CIFBF, relays (i.e., the distributed antenna 
elements) have very low-complexity and are oblivious to 
codebooks since they just forward the received signals from 
CP, not requiring modulation and encoding. 

Remark 3: In terms of performance, it is worthwhile under- 
stand the impact of non-integer penalty and quantization noise 
depending on parameters i?o, SNR, and so on. As i?o oo, 
the effect of quantization noise vanishes and thus, CIFBF 
would be better than RCoF. However, when i?o is small, RCoF 
without beamforming may perform better than CIFBF since 
quantization noise would be severe in this case. A numerical 
result in a particular case is provided in Fig. [T] 

IV. Scheduling and Numerical Results 

For the sake of comparison with CDPC we consider the 
same Soft-Handoff model of ||2|, with L ATs and L UTs for 
which the received signal at the £-th UT is given by 

yt,i = xt,i + jxi^i^i + zi^i, (26) 

where 7 G [0, 1] represents the inter-cell interference level and 
Zi_i ^ CJ\f{0, 1). The extension of results in previous sections 
to the (complex-valued) Soft-Handoff model is easy and done 
in the usual way |[T|. In this example, thanks to the dual- 
diagonal structure of the channel matrix, the system matrix 
is guaranteed to have rank L. In Fig. [T] we compare various 
coding strategies where the upper bound and achievable rates 
for CDPC are provided by |2 |. It is remarkable that RCoF can 
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Fig. L SNR = 20dB. Achievable rates per user as a function of finite 
capacity Rq, for inter-cell interference 7 ~ Uniform(0.5, 1). 
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Fig. 2. Achievable rates per user as a function of SNRs, for finite capacity 
-Ro = 2 or 4 bits, inter-cell interference level 7 = 0.7, and p = 251 for 
RQCoF 



achieve the upper bound when Rq < 4 bits and outperforms 
the other schemes up to i?o ~ 6 bits. Notice that when 7 1 
(e.g., integer channel matrix), RCoF almost achieves the upper 
bound, showing better performance than other schemes. Also, 
from the Fig. [2] we can see that RCoF is a good scheme when 
i?o is small and SNR is high, i.e., small cell networks with 
finite-backhaul capacity. Not surprisingly, RQCoF approaches 
the performance of RCoF within the shaping loss of w 0.25 
bits/symbol, as already noticed in the uplink case |Q. 

For the RQCoF, there would be a concern on rank-deficiency 
of system matrices Q in particular when p is small, since 
every UT selects its own linear combination coefficients in- 
dependently of the other nodes. This problem can be avoid 
by scheduling since it can select a group of UTs (or ATs) 
for which the system matrix is invertible. In fact, this is a 
complex combinatorial optimization problem, which in some 
cases, can be formulated as the maximization of linear function 
over matroid constraint |10| and thus, greedy algorithm yields 
provably good performance. The following is the example that 



greedy algorithm is optimal. Consider a DAS system with 
K UTs and L ATs where K > L and we consider the 
user selection that finds the subset of UTs to maximize the 
symmetric rate subject to full-rank constraint of system matrix. 
Independently of the user selection algorithm, we can obtain 
the coefficients of the linear combination of the fc-th UT (e.g., 
fc-th row of Q) and the variance of effective noise (i.e., cr^^ 
for k = 1,...,K) that determines the achievable rate, for 
the given H e M'''''^. Let /C be the subset of row indices 
[1 : K]. Also, let Q(/C) denote the submatrix of Q consisting 
of fc-th rows for k £ IC. Assuming that Q has rank L, the user 
selection problem of finding L UTs can be formulated as 

argmin max{(T^^ : k £ JC} (27) 

KC[1:K] 

subject to Rankp(Q(/C)) = L (28) 

We first give the definition of matroid and subsequently, show 
that the above problem is equivalent to the maximization of 
linear function over matroid constraint. Matroids are structures 
that generalize the concept of linear independence for general 
sets. Formally, we have |10|: 

Definition 1: A matroid 7W is a tuple A4 = where 
57 is a finite ground set and I C 2^^ (the power set of il) is a 
collection of independent sets, such that: 

1) I is nonempty, in particular, (f) E I 

2) I is downward closed; i.e., if y E I and X C y, then 

X el 

3) if X,y e I, and \X\ < \y\, then 3yey\X such that 
XU{y}e I. 

Let fl = [1 : K] Siud I ^ {JC C [1 : K] : 
Q(/C) has linearly independent rows}. From Definition [T] 
Ai = (51,2^) forms a so-called linear matroid. Then, the 
optimization problem ([27]i-([28]l is equivalent to 

argmax l/c^e (29) 

ICC[1:K] 

subject to JCel (30) 

This can be easily proved by the fact that Q has rank L and 
constraint is matroid. Rado and Edmonds proved that the Best- 
In-Greedy algorithm (See Algorithm 1) finds an optimal solu- 
tion |[l0|. Detailed scheduling algorithms for various scenarios 
are omitted because of space limitation (See |1|). 

Algorithm 1 Best-In-Greedy Algorithm 

Input: M = {n,I) and Wk = l/cr^^ for k e [1 : K] 
step 0. Sort [1 : K] such that wi > W2 > ■ ■ ■ > wk 

Initially fc = 1 and K, = <j) 
step LIf Rankp(Q(/C U {k})) > Rankp(Q(/C)), then K. ^ 

/CU{fc} 
step 2. Set A: = fc + 1 
step 3. Repeat until Rankp(Q(/C)) = L 



In Fig. [3] we consider a DAS with channel matrix H € 
M^°^^, with i.i.d. Gaussian distributed elements ^ Af{Q,l). 
In our simulation we assumed that if the resulting system 
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Fig. 3. Achievable rates per user as a function of SNRs, for finite capacity 
Ro = 3 bits and p = 17 for RQCoF. 

matrix after greedy selection is rank deficient then the achieved 
symmetric rate of all users is zero, for that specific channel 
realization. Then, we computed the average achievable rate 
with user selection, by Monte Carlo averaging with respect to 
the random channel matrix. Random selection indicates that 5 
UTs are randomly and uniformly chosen out of the 20 UTs. 
As shown in Fig.|3] RCoF has the rank-deficiency when using 
random selection, although the rank of the resulting 5x5 
matrix over R is equal to 5 with probability 1 . However, it is 
remarkable that RQCoF with greedy user selection does not 
suffer from the rank-deficiency problem, even for relatively 
small values of p (e.g., p — 17). This is indicated by the fact 
that the gap from the RCoF is essentially equal to the the 
shaping loss, as in the case where the full-rank system matrix 
is guaranteed by assumption. 
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